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Abstract 

Radio Frequency IDentification (RFID) systems are becoming more 
and more popular in the field of ubiquitous computing, in particular for 
objects identification. An RFID system is composed by one or more read- 
ers and a number of tags. One of the main issues in an RFID network 
is the fast and reliable identification of all tags in the reader range. The 
reader issues some queries, and tags properly answer. Then, the reader 
must identify the tags from such answers. This is crucial for most appli- 
cations. Since the transmission medium is shared, the typical problem to 
be faced is a MAC- like one, i.e. to avoid or limit the number of tags trans- 
mission collisions. We propose a protocol which, under some assumptions 
about transmission techniques, always achieves a 100% perfomance. It is 
based on a proper recursive splitting of the concurrent tags sets, until all 
tags have been identified. The other approaches present in literature have 
performances of about 42% in the average at most. The counterpart is 
a more sophisticated hardware to be deployed in the manufacture of low 
cost tags. 

1 Introduction 

In the last years, Radio Frequency Identification (RFID) systems have received 
much attention both in academia and in many industries, for applications of 
monitoring and tracking. 
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RFID systems offer a promisingly affordable, cheap and flexible solution for 
object identification. For instance, fast and reliable reading of labels (tags) at- 
tached to different objects stowed in warehouses can greatly speed up operations 
such as localization and retrieval. Among the countless applications, we can cite 
public transportation and ticketing, access control, production control, animal 
identification, inventory control, electronic payments, speed-up the checkout in 
shops, location monitoring in real-time of all tagged objects within the supply 
chain, secure operations in dangerous environments, localization of objects (like 
cars in parking lots, or books in libraries), and of people. 

RFID systems consist of networked electromagnetic readers and tags, where 
the readers try to identify the tags attached to objects. The reader is an entity 
with great computation power and memory, while tags have (very) limited com- 
putation resources. Tags have a unique ID and transmit their own ID to the 
reader for being identified. The reader must be able to identify tags as quickly 
as possible by wireless communications. 

RFID tags can be active or passive, according to their technology 1]. Ac- 
tive tags are continually powered by internal batteries, while passive tags have 
no internal power and reflect energy from the reader for communication. Con- 
sequently, active tags can be read from a greater distance than passive tags. 
Moreover, active tags have more functions than passive tags, e.g. data storage 
and sensor capabilities. Tags have to be very simple in order to be as cheap 
as possible. The lifetime of active tags is over when the battery is exhausted. 
It is non economical to replace or recharge an active tag battery, because of 
large number of small and cheap nodes. For these reasons, active tags are not 
practical for use with disposable consumer products, for instance. Passive tags 
instead rely only on RF energy induced by the electromagnetic waves emitted by 
the reader. In a typical communication sequence, the reader emits a continuous 
radio frequency wave. When a tag enters in the RF field of the reader, it re- 
ceives energy from the field. Continuous advancements in protocols and circuits 
design, make the reliability and the reading range of passive tags RFID systems 
continuously improving. Besides, their cost continues to decrease, leading to 
an increase of their applications. To meet cost, size and lifetime requirements, 
passive tags seem to be the best solution for RFID systems, and we focus on 
them in this paper. 

The limited available bandwidth forces all tags to share a common broadcast 
wireless channel. This can lead to the collision problem in signal transmission 
of the readers or the tags, which hardly leads to fast identification. As a result, 
either the reader may not recognize all objects, or a tag identification process 
may suffer from long delay. Therefore, anti-collision protocols which enable 
the fast and correct identification regardless of the occurrence of collisions are 
required. 

Collisions are divided into reader collisions [31 2] and tag collisions. Reader 
collision problems arise when multiple readers are simultaneously used [31 3] . 
The other collision problem (approached in this paper) is the tag collision one, 
that occurs when several tags try to answer to a reader query at the same time. 
The reader queries the tags for their ID through the RF communication channel, 
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by broadcasting a request message to tags. Upon receiving this message, all tags 
send an answer back to the reader. If only one tag answers, the reader identifies 
the tag. If more than one tag answer, their messages will collide on the RF 
communication channel, and the reader cannot identify these tags. This is a 
special case of the multiple-access communication problem. However, RFID 
applications introduce a more challenging aspect to the problem. Since the 
low functional power and energy constraints in each tag, it is unreasonable to 
assume that tags can communicate with each other directly, and that they can 
notice their neighboring tags or detect collisions. Tags anti-collision protocols 
are required for identification in RFID systems. A natural question is: what 
is the best communication protocol so that the tags can be read as fast and 
reliable as possible? This is known as anti-collision problem. The issue becomes 
more challenging on considering that the tags must be simple, cheap and small 
enough. There are two families of protocols for approaching the tag collision 
problem: a family of probabilistic protocols, and a family of deterministic ones. 
We will survey both families in Section El In this paper, we propose a new anti- 
collision protocol which, assuming a particular RFID system simple technology, 
achieves an optimal performance outperforming all existing protocols. 

The paper is organized as follows. In Section El we survey the known suit- 
able protocols for our problem. In Section El we describe our approach, by 
highlighting the assumptions we made, and the achieved performance, in terms 
of number of reader queries. In Section [H we assess the protocol performance. 
Conclusions (Section [5]) terminate the paper. 

2 Related Work 

The proposed protocols for tag collision resolution in RFID systems are either 
probabilistic or deterministic. The first ones are Aloha-based protocols [5j [6j IHl 
El HH El HS1 E21 ES], the last ones are tree-based protocols [H US UM EE ED 
26, 27 d . There are also hybrid approaches, where randomization is applied in 
tree schemes [T7l [T8l 125] . 

Aloha-based protocols, such as Aloha, Slotted Aloha, Framed Slotted Aloha, 
and Dynamic Framed Slotted Aloha, reduce the probability of the occurrence 
of tag collisions since tags try to answer at distinct times. 

In Framed Slotted Aloha, a frame is a time interval between requests of a 
reader and consists of a number of slots. Each tag transmits its serial number 
to the reader in a randomly selected slot of a frame, and the reader identifies 
tags when a time slot is used by one tag only. Read cycles are repeated for 
unrecognized (namely, whose transmission results in a collision) tags, until all 
tags have been identified. So, in the last read cycle there must be no collision. 

The basic Framed Slotted Aloha anti-collision protocol [8] uses a fixed frame 
size and does not change the size during the process of tag identification. The 
Dynamic Framed Slotted Aloha [Hj protocol changes the frame size dynami- 
cally. The constraint of this protocol is that the frame size cannot be increased 
indefinitely as the number of tags increases, but it has an upper bound. This 
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implies a very high number of collisions when the number of tags exceeds the 
maximum admitted frame size. The Enhanced Dynamic Framed Slotted Aloha 
(EDFSA) protocol, analyzed in [5], overcomes such problem by dividing the 
unread tags into a number of groups and interrogating each group of them re- 
spectively. The system efficiency of EDFSA protocol is improved by the Variant 
Enhanced Dynamic Framed Slotted Aloha protocol [33] where a dynamic group 
dividing approach is adopted. 

The frame size affects the performance of Aloha based algorithms. A small 
frame size results in many collisions, and so increases the required total number 
of slots when the number of tags is high. In contrast, a large frame size may 
result in more idle time slots, when the number of tags is small. Different 
methods estimating the number of unread tags, permit to the reader of choosing 
an optimal frame size for the next read cycle, are presented in [23l [30l [29] . 

In [9] , the Tree Slotted Aloha (TSA) protocol is proposed. It aims to reduce 
tag transmission collisions by querying only those tags colliding in the same slot 
of a previous frame of transmissions. At the end of each frame, for each slot 
in which a collision occurred, the reader starts a new small frame, reserved to 
those tags which collided in the same time slot. In this way, a transmission 
frame can be viewed as a node in a tree, where the root is the initial frame; 
leaves represent frames where no collision occurred. 

Aloha-based protocols can not perfectly prevent collisions. In addition, they 
have the serious problem that a tag may not be identified for an unlimited time 
(the so called tag starvation problem). 

Tree-based tag anti-collision protocols can have a longer identification delay 
than Slotted Aloha based ones, but they are able to avoid the tag starvation. 
Among tree-based protocols, there are binary search protocols pT] and query 
tree protocols [T5] . 

In query tree protocol (QT), the reader sends a prefix, and all tags having 
the ID matching the prefix, answer. If there is a collision, the reader queries for a 
one bit longer prefix, until no collision occurs. Once a tag is identified, the reader 
starts a new round of queries with another prefix. In [12j , many improvements of 
QT protocol have been presented, for increasing its performance. Among them, 
the best performing one is called Query Tree Improved [T5] . This protocol avoids 
the queries that certainly will produce collisions. 

Since tags do not need additional memory except the ID, query tree protocols 
have the advantage to be memoryless, and for this they require low functional 
and less expensive tags. However, since they use prefixes, their performance is 
sensitive to the distribution of tag IDs which a reader have to identify. 

In [26], a Prefix- Randomized Query Tree protocol is proposed, where tags 
randomly choose the prefixes rather than using their ID-based ones. The identi- 
fication time of this protocol is improved with respect to QT protocol because it 
is no longer affected by the length and distribution of tag IDs as in QT protocol. 

In binary search protocols, [TQ1 [H], the reader performs identification by 
recursively splitting the set of answering tags. Each tag has a counter initially 
set to zero. Only tags with counter set to zero answer the reader's queries. After 
each tag transmission, the reader notifies the outcome of the query: collision, 
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identification, or no-answer. When tag collision occurs, each tag with counter set 
to zero, adds a random binary number to its counter. The other tags increase 
by one their counters. In such a way, the set of answering tags is randomly 
split into two subsets. After a no-collision transmission, all tags decrease their 
counters by one. 

In [57], the bi-slotted query tree algorithm, and bi-slotted collision track- 
ing tree algorithm are proposed. They reduce the prefix overhead for fast tag 
identification. 

Myung et al. [THlHHm] proposed two adaptive tag anti-collision protocols: 
the Adaptive Query Splitting protocol which is an improvement of the Query 
Tree protocol, and the Adaptive Binary Splitting protocol which is based on 
the Binary Tree protocol. For reducing collisions, the proposed protocols use 
information obtained from the last process of tag identification, assuming that 
in most object tracking and monitoring applications the set of RFID tags en- 
countered in successive reading from a reader does not change substantially and 
information from a reading process can be used for the next one. An improve- 
ment of Adaptive Binary Splitting protocol is proposed in [28] . 

Performance for tags collision problem, is usually computed as the ratio 
between the number of tags to be identified, and the number of queries (or 
time slots, in case of Slotted Aloha based protocols) performed in the whole 
identification process. This metric is referred as system efficiency [HIS]. 

All the protocols proposed so far exhibit an average performance (both in 
terms of messages and in terms of transmitted bits, two metrics strictly related 
to system efficiency) well below 50%. Actually, the best performing protocols, 
namely QT and TSA, perform around 40% of system capacity on average, in 
their best performing versions [El [9] • Such results are reported in the above 
papers, where the protocols have been proposed, and were substantially con- 
firmed in a paper [24] describing an extensive simulation experiment set up for 
assessing the average performance of several tag identification protocols. 

In this paper we propose an optimum tag identification protocol that out- 
performs, at the best of our knowledge, all existing protocols. We assume a 
particular simple RFID system technology that we describe in Section 13.11 and 
we show in Section Q] that by using this RFID system technology, our protocol 
achieves a system efficiency of 100%, and so perfectly solves the problem. 

3 Perfect tag identification protocol 

In this section, we first state the assumptions made on the transmission tech- 
niques used by the target RFID systems. Then, we give a detailed description 
of our protocol. 

3.1 System assumptions 

In this paper, we assume that signals transmitted by multiple entities sum up 
at the receiver, with no loss of information. Besides, we assume (as usual in 
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this setting) that the signal corresponding to a bit equal to is the complement 
of that corresponding to a bit equal to 1 (this is what happens, for instance, 
when a Manchester bit coding is used). We shall call such signals and 
respectively. For instance, if we have four tags with IDs 011010, 010101, 
110001, 011111, the simoultaneous transmission of their ID will be received by 
the reader as "-2 4 2". From the collided tags answers, a reader can 
detect the exact number of transmitting tags, provided that all such tags have 
transmitted the same bit (say "1") in the same position (say, the second one). 
In this case, the maximum (in absolute value) of the received signals is equal to 
the number of transmitting tags. In the above example, all four tags transmit 
a "+1" in second position. However, this is not always the case: consider for 
example the above tags, with the first one having a "0" in second position, 
instead of a "1". Then, the signals received by the reader would be "-2 2 
2" . This sort of "capture effect" prevents from exactly knowing the number of 
answering tags. As we shall see, this is not a problem since our protocol works 
in splitting the tags in two subsets, with possibly one subset containing only 
one tag (thus identifying it). 

The important thing is that, when there are multiple answering tags, the 
received signal is not formed by "+1" and "-1" only. This happens only when 
the number of "0" bits and that of "1" bits differ by exactly one. In this case, 
we have a "false positive" , namely we identify one tag that is not present in our 
set, whose ID is given by the sum of the IDs of the colliding tags. For example, 
if IDs "0 1 1 1" , "0 1 1 0" ,"1 1 0" arc transmitted simultaneously, 
the reader will receive "00 1 0" which does not correspond to any ID of tags 
in its range. Notice that tag IDs are 48, 96, bits long, and so the probability 
of this "false positive" is extremely low. Anyway, the above problem can be 
overcome by adding a one bit prefix to each ID, so forcing each tag to transmit 
a "+1" (for instance) before their ID. The cost of this is obviously negligible, 
given the above reported ID's lengths. 

3.2 The Protocol 

Given the above assumptions, it is possible to build a protocol which always 
achieves a 100% performance. 

The protocol, called P, works as follows. Initially, the reader asks all tags 
to transmit their own complete ID. From the answer, which is the sum of all 
IDs, the reader starts to divide the set of tags in a recursive way until all 
tags are identified. The partioning is done in the following manner: the reader 
chooses the first bit position, say pi, for which the answer presents the maximum 
absolute value, say ml. Two cases can occur: 

ml = 1: 

In this case, a tag ID is recognized; 
ml > 1: 

In this case, the reader searches the first position, say p2, where a value, 
say m2, equal to the largest absolute value smaller than ml is present in 
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the previous answer. Then, it issues a new query by asking all tags having 
"1" in position p2, with the same sign of m2. Then, the reader repeats 
the above procedure with the outcome of the last query. 

When a tag ID is recognized, the reader obtains another tag ID or the sum 
of more than one tag ID, by subtracting the recognized ID from the previous 
answer, so without issuing any query. By recursively applying the previous 
procedure, the reader always splits the responding tags into two groups: one 
having in a given position, the other having in the same position. 

Since any two tag IDs differ in at least one bit position, the protocol converges 
to a complete identification of all tags. 

In Table [U we sketch the behavior of protocol P. When the reader issues 
a query, it transmits a mask of bits of the same length of tag IDs (K in the 
pseudocode), initially empty (empty value is coded with "0"). Tags have their 
IDs stored with instead of "0", and in Table [T] such an ID is referred as 
mylD. 

In Figure [TJ we show the execution of P protocol on a set of seven tags. The 
queries issued by the reader are represented with bold arrows between father 
nodes and left son nodes. Arrows between father nodes and right son nodes, 
instead, represent IDs' sums obtained by difference between the previous and 
the actual answers. As evident from the figure, the execution of P protocol is 
equivalent to a depth first search (DFS) on the tree, where the leaves represent 
the IDs of the tags to be identified. 

Notice that tag hardware is very simple: tags have only to listen the reader 
queries and answer if the received mask matches with their own ID. So tags are 
memory less. 

4 Protocol properties 

In this section, we formally show that the above protocol always achieves 100% 
performance, provided that the transmission assumptions are met. 

Lemma 1. The execution of protocol P for identifying N tags can be represented 
by a full binary tree with 2N — 1 nodes. 

Proof. A full binary tree is a tree in which every node has zero or two sons [2]. 
P works by dividing the tags set into two subsets: each subset is represented 
by a node in the full binary tree. When a set is partitioned by querying the 
tags having a given value or in a given bit position, the resulting 

two subsets are the sons of the node associated to the previous set. The subset 
related to the left son is obtained by query, while the other set (the right son) 
is obtained by difference between the previous answer (father) and the current 
answer (left son). Then, all the queries done by the reader are associated with 
all the edges between father and left son, in addition to the first query, whose 
answer is associated with the root of the full binary tree. A node associated with 
a subset with cardinality equal to one, is a leaf and it represents an identified 
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Table 1 : Pseudo-code of protocol P 



Reader procedure: 

/ * initialization * '/ 

1. for (i = 0; i < K; i + +) mask[i] = 0; 

/*ask all tags for transmitting their ID*/ 

2. broadcast (mask); 

/*put the sum of ID signals in an array called answer */ 

3. answer <— getIDs(); 

/*analyse the result*/ 

4. check(answer) . 
check(answer): 

/*flnd the maximum absolute value present in answer and put it in ml */ 

1. ml <— max absolute value in answer; 

/*if ml equals 1, a tag is identified*/ 

2. if (ml = 1) then 

3. tag with ID equal to answer is identified; 

4. return ; 

/*if ml > 1 then the reader needs more queries*/ 

5. if (ml > 1) then 

/*find the second maximum absolute value in answer and put it in m2*/ 

6. m2 <— second max absolute value in answer] 

7. p2 <— bit position corresponding to m2; 

/*s1 gets ±1 with the same sign of answer [p2]*/ 

8. if (answer[p2] > 0) then s2 <- "+1"; 

9. else s2 <- "-1"; 

/*set the bit mask before broadcasting it*/ 

10. mask[p2] — s2; 

/*issue the query*/ 

11. broadcast (mask); 

/*put the sum of ID signals in an array called new Answer * / 

12. new Answer <— getIDs(); 

/* analyse the result*/ 

13. check(new Answer); 

/*set the mask for being used in the right subtree*/ 

14. mask[p2] = —s2; 

/*analyse the sum of IDs obtained by difference*/ 

15. check(answer — new Answer). 

/*reset the mask*/ 

16. mask[p2] = 0; 

Tag procedure: 

1. while (TRUE) 

2. Wait for reader transmission; 

3. if (mylD matches mask) then transmit mylD; 
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Figure 1: Example of protocol execution. 



tag. So, the number of leaves of the tree is N (the number of tags). It is well 
known that any full binary tree with N leaves has 2 N — 1 nodes [2] • □ 

Given the above Lemma, we are able to state the following theorem. 

Theorem 1. Protocol P achieves always a 100% performance. 

Proof. Due to Leamma[TJ every execution of P for identifying N tags, is equiv- 
alent to a full binary tree with 2N — 1 nodes. Queries are as many as the links 
between fathers and left sons of the associated full binary tree, plus one for the 
root. So, if the associated full binary tree has 2N — 1 nodes, it has N — 1 left 
sons. Adding the root, we obtain that the number of queries is equal to N. 

On the other hand, a full binary tree with 2N — 1 nodes, has N leaves [2]. 
Since leaves represent subsets with cardinality equal to one, they represent the 
tags participating to the identification process. 

So, the number of queries is equal to the number of tags. This means that 
P always achieves the 100% performance. □ 



5 Conclusion and open Problems 

In this paper, we investigated the tags collision problem in RFID systems. We 
presented a novel protocol, and showed that its performance is 100%, provided 
that some assumptions on ID transmissions and receptions are met. The pro- 
tocol outperforms the currently known protocols for the same problem. As 
the most important open problem, we cite the performance evaluation of our 
protocol when the assumptions made do not hold. 
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